A Comparative Study of the Multi-Layer Perceptron, the Multi-Output Layer Perceptron, the Time-Delay Neural Network and the Kohonen Self-Organising Map in an Automatic Speech Recognition Task
نویسندگان
چکیده
This paper describes a study of the use of four different neural network techniques for automatic speech recognition (ASR) using two common, real-world application databases. The neural network techniques investigated were the Multi-Layer Perceptron (MLP), the Multi-Output-Layer Perceptron (MOLP), which is an improved version of the MLP , the Time-Delay Neural Network (TDNN) and the Kohohnen Self-Organising Map (SOM). The speech test data consisted of a clean database, acquired in a relatively noise-free room environment, and a telephone database, acquired over conventional dial-up lines. Each database comprised 20 repetitions of 12 isolated words (the digits 0 9 plus ‘nought’ and ‘oh’) each spoken by 25 talkers. Each word was parameterised into a time sequence of 15 frames of an 18-dimension feature vector, consisting of 8 Melfrequency Cepstral Coefficients (MFCCs), the corresponding frame-to-frame MFCC differential coefficients and absolute and differential signal energy coefficients. In a speaker-independent, isolated-word speech recognition task, the respective recognition scores for the MLP, MOLP, TDNN and SOM were 93.2%, 95.5%, 95.1%, and 97.1% respectively for the clean speech database , and 76.3%, 90.5%, 90.5% and 96.8% respectively for the telephone database.
منابع مشابه
New full adders using multi-layer perceptron network
How to reconfigure a logic gate for a variety of functions is an interesting topic. In this paper, a different method of designing logic gates are proposed. Initially, due to the training ability of the multilayer perceptron neural network, it was used to create a new type of logic and full adder gates. In this method, the perceptron network was trained and then tested. This network was 100% ac...
متن کاملThe Application of Multi-Layer Artificial Neural Networks in Speckle Reduction (Methodology)
Optical Coherence Tomography (OCT) uses the spatial and temporal coherence properties of optical waves backscattered from a tissue sample to form an image. An inherent characteristic of coherent imaging is the presence of speckle noise. In this study we use a new ensemble framework which is a combination of several Multi-Layer Perceptron (MLP) neural networks to denoise OCT images. The noise is...
متن کاملConcurrent Neural Networks for Speaker Recognition
We propose a new recognition model called Concurrent Neural Networks (CNN), representing a winner-takes-all collection of neural networks. Each network of the system is trained individually to provide best results for one class only. We have applied the above model for the task of speaker recognition. We performed distinct speaker recognition experiments using three variants of basic components...
متن کاملAn Automated MR Image Segmentation System Using Multi-layer Perceptron Neural Network
Background: Brain tissue segmentation for delineation of 3D anatomical structures from magnetic resonance (MR) images can be used for neuro-degenerative disorders, characterizing morphological differences between subjects based on volumetric analysis of gray matter (GM), white matter (WM) and cerebrospinal fluid (CSF), but only if the obtained segmentation results are correct. Due to image arti...
متن کاملPerformance comparison of land change modeling techniques for land use projection of arid watersheds
The change of land use/land cover has been known as an imperative force in environmental alteration, especially in arid and semi-arid areas. This research was mainly aimed to assess the validity of two major types of land change modeling techniques via a three dimensional approach in Birjand urban watershed located in an arid climatic region of Iran. Thus, a Markovian approach based on two suit...
متن کامل